Quantile based histogram equalization for noise robust speech recognition
نویسنده
چکیده
This paper describes an approach to increase the noise robustness of automatic speech recognition systems by, transforming the signal after Mel scaled filtering, to make the cumulative density functions of the signal’s values in recognition match the ones that where estimated on the training data. The cumulative density functions are approximated using a small number of quantiles. Recognition tests on several databases showed significant reductions of the word error rates. On a real life database recorded in driving cars with a large mismatch between the training and testing conditions the relative reductions of the word error rates where over 60%.
منابع مشابه
A Comparative Study of Histogram Equalization (HEQ) for Robust Speech Recognition
The performance of current automatic speech recognition (ASR) systems often deteriorates radically when the input speech is corrupted by various kinds of noise sources. Quite a few techniques have been proposed to improve ASR robustness over the past several years. Histogram equalization (HEQ) is one of the most efficient techniques that have been used to reduce the mismatch between training an...
متن کاملImproved Histogram Equalization (heq) for Robust Speech Recognition
With the rapid development of Intelligent Transportation Systems (ITS), how to provide users with a natural and efficient humanmachine interface is now becoming a crucial issue for driver safety. It is no doubt that speech will be one of the best mediators of human-machine interaction; however, the performance of automatic speech recognition (ASR) always radically degrades when the input speech...
متن کاملEvaluation of Quantile Based Histogram Equalization in Combination with Different Root Functions
This paper presents an evaluation of the RWTH large vocabulary speech recognition system on the Aurora 4 noisy Wall Street Journal database. First, the influence of different root functions replacing the logarithm in the feature extraction is studied. Then quantile based histogram equalization is applied, a parametric method to increase the noise robustness by reducing the mismatch between the ...
متن کاملA new feature extraction front-end for robust speech recognition using progressive histogram equalization and multi-eigenvector temporal filtering
In this paper, a new feature extraction front-end for robust speech recognition using progressive histogram equalization and multi-eigenvector temporal filtering is proposed. The progressive histogram equalization (PHEQ) performs the histogram equalization (HEQ) progressively with respect to a reference interval which moves with the present frame to be processed. The multi-eigenvector temporal ...
متن کاملCompensating Acoustic Mismatch Using Class-Based Histogram Equalization for Robust Speech Recognition
A new class-based histogram equalization method is proposed for robust speech recognition. The proposed method aims at not only compensating for an acoustic mismatch between training and test environments but also reducing the two fundamental limitations of the conventional histogram equalization method, the discrepancy between the phonetic distributions of training and test speech data, and th...
متن کامل